PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG037207t2
Common NameTCM_037207
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 813aa    MW: 89359.5 Da    PI: 6.0006
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG037207t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.88.9e-192078357
                      --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
          Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                      k  ++t+eq+++Le+l++++++ps  +r++L +++    +++ +q+kvWFqNrR +ek+
  Thecc1EG037207t2 20 KYVRYTPEQVDALERLYHECPKPSSMRRQQLIRECpilaNIEPKQIKVWFQNRRCREKQ 78
                      5679*****************************************************97 PP

2START185.62.7e-581643722205
                       HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT..EEEE CS
             START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg..galq 92 
                       +aee+++e+++ka+ ++  Wv+++ +++g++++ +++ s++++g a+ra+g+v  +++  v+e+l+d++ W ++++++++++v+s+g  g+++
  Thecc1EG037207t2 164 IAEETLTEFLSKATGTAVEWVQMPGMKPGPDSIGIVAISHGCTGVAARACGLVGLDPT-RVAEILKDRPSWFRDCRAVDVMNVLSTGngGTIE 255
                       789*******************************************************.8888888888************************ PP

                       EEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHH CS
             START  93 lmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwll 181
                       l +++l+a+++l+p Rdf+ +Ry+  l++g++v++++S++++q+ p+    +++vRae+lpSg+li+p+++g+s +++v+h+dl+ ++++++l
  Thecc1EG037207t2 256 LLYMQLYAPTTLAPaRDFWLLRYTSVLEDGSLVVCERSLNNTQNGPSippAANFVRAEMLPSGYLIRPCEGGGSIIHIVDHMDLEPWSVPEVL 348
                       **********************************************9999******************************************* PP

                       HHHHHHHHHHHHHHHHHHTXXXXX CS
             START 182 rslvksglaegaktwvatlqrqce 205
                       r+l++s++  ++kt++a+l+++++
  Thecc1EG037207t2 349 RPLYESSTLLAQKTTMAALRHLRQ 372
                       *******************99876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007115.7581579IPR001356Homeobox domain
SMARTSM003896.8E-161783IPR001356Homeobox domain
CDDcd000867.37E-172080No hitNo description
SuperFamilySSF466897.7E-172083IPR009057Homeodomain-like
PfamPF000462.3E-162178IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.609.4E-192278IPR009057Homeodomain-like
CDDcd146861.10E-672111No hitNo description
PROSITE profilePS5084826.368154382IPR002913START domain
CDDcd088751.60E-84158373No hitNo description
SuperFamilySSF559614.81E-38163375No hitNo description
Gene3DG3DSA:3.30.530.204.1E-23163369IPR023393START-like domain
SMARTSM002341.5E-44163373IPR002913START domain
PfamPF018528.4E-56164372IPR002913START domain
PfamPF086709.5E-28698792IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008284Biological Processpositive regulation of cell proliferation
GO:0009733Biological Processresponse to auxin
GO:0010067Biological Processprocambium histogenesis
GO:0010072Biological Processprimary shoot apical meristem specification
GO:0010089Biological Processxylem development
GO:0045597Biological Processpositive regulation of cell differentiation
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 813 aa     Download sequence    Send to blast
MMAVTSSCKE GNKIAMDNGK YVRYTPEQVD ALERLYHECP KPSSMRRQQL IRECPILANI  60
EPKQIKVWFQ NRRCREKQRK EASRLQAVNR KLTAMNKLLM EENDRLQKQV SQLVYENSYF  120
RQQTQNATLA TTDTSCESVV TSGQHHLTPQ HPPRDASPAG LLSIAEETLT EFLSKATGTA  180
VEWVQMPGMK PGPDSIGIVA ISHGCTGVAA RACGLVGLDP TRVAEILKDR PSWFRDCRAV  240
DVMNVLSTGN GGTIELLYMQ LYAPTTLAPA RDFWLLRYTS VLEDGSLVVC ERSLNNTQNG  300
PSIPPAANFV RAEMLPSGYL IRPCEGGGSI IHIVDHMDLE PWSVPEVLRP LYESSTLLAQ  360
KTTMAALRHL RQISQEISQP NVTGWGRRPA ALRALSQKLS KGFNEAVNGF TDEGWSMLES  420
DGVDDVTLLV NSSPGKMMGI NLSYSNGFPS MGNAVLCAKA SMLLQNVPPA ILLRFLREHR  480
SEWADSGIDA YSAAAVKAGP CSLPVSRGGS FGGQVILPLA HTIEHEEFME VIKLENMGHY  540
RDDMIMPGDI FLLQLCSGVD ENAVGTCAEL IFAPIDASFS DDAPIIPSGF RIIPLDSGMD  600
ASSPNRTLDL ASTLEVGAAG NRATGDHSGR CGSTKSVMTI AFQFVYEIHL QENVATMARQ  660
YVRSIIASVQ RVALALSPSR FGSLADFRTP PGTPEAQTLG RWICDSYRCY LGVELLKNEG  720
SESILKMLWH HTDAVLCCSL KALPVFTFAN QAGLDMLETT LVALQDISLE KIFDENGRKA  780
LFAEFPQVMQ QLIIVSPLFW FLVEQGFMCL QGG
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007012153.10.0Class III HD-Zip protein 8 isoform 2, partial
SwissprotQ391230.0ATHB8_ARATH; Homeobox-leucine zipper protein ATHB-8
TrEMBLA0A061GKU30.0A0A061GKU3_THECC; Class III HD-Zip protein 8 isoform 2 (Fragment)
STRINGPOPTR_0006s25390.10.0(Populus trichocarpa)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32880.10.0homeobox gene 8
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]